A survey of Shared-Nothing Parallel Database Management Systems

نویسنده

  • Thomas Müseler
چکیده

Distributed database systems can be implemented in a many different ways. Mostly, they are customized for a special environment to handle big data problems. The data warehouse sector relies on these amounts, but has changed from a data storage to a real time management support during the last years [3]. The resulting increase of compution and storage capacity poses new requirements to the database systems. Previous approaches of a parallel database environment tried to solve this problem with shared disk and memory approaches. The main contribution of this paper is the presentation of the current technology in the shared-nothing database sector. The concepts of the manufacturers Teradata, Greenplum and Netezza will be discussed for data warehouse requirements. Based on an architectural overview is a detailed insight of the index functionality given which is a crucial performance factor. Also data distribution algorithms of the manufacturers are analysed under data warehouse conditions. At the end is a comparison to other shared concepts (shareddisk, shared-everything) given and the question raised, if the actual approach can be fulfilled by the manufacturers.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Performance Study of Locking Granularity in Shared-Nothing Parallel Database Systems

Locking granularity refers to the size of a lockable data unit, called a "granule", in a database system. Fine granularity improves system performance by increasing concurrency level but it also increases lock management overhead. Coarse granularity, on the other hand, sacrifices system performance but lowers the lock management cost. This paper explores the impact of granule size on performanc...

متن کامل

Affordable Parallel Database Dual Degree Project Stage - I Report

The rate of increase in database size and response time requirements has outpaced advancements in processor and mass storage technology. One way to satisfy the increasing demand for processing power and I/O bandwidth in database applications is to have a number of processors, loosely or tightly coupled, serving database requests concurrently. Technologies developed during the last decade have m...

متن کامل

Parallelising A Commercial DBMS

Abstract Research into DBMS (Database Management System) parallelism has been carried out to address the performance problems experienced in areas such as Decision Support. Distributed shared memory can alleviate the porting of commercial DBMSs to parallel platforms. However, this restricts the scaleability and performance achievable using shared nothing approaches. Consequently we propose the ...

متن کامل

Data Placement in a Shared - Nothing Parallel Deductive Database

Until recently most research into parallel databases has focussed on relational database systems. Nevertheless, there is growing interest in more powerful alternative systems such as deductive databases. Several rule handling strategies have been developed to incorporate deductive capabilities into parallel database systems. However, in a shared-nothing environment, the performance of a rule ha...

متن کامل

Data Placement in Parallel Database Systems

The way in which data is distributed across the processing elements of a parallel shared-nothing architecture can have a signiicant eeect on the performance of a parallel DBMS. Data placement strategies provide a mechanical approach to determining a data distribution which will provide good performance. However, there is considerable variation in the results produced by diierent strategies and ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012